RESERVOIR COMPUTING WITH TRUNCATED NORMAL DISTRIBUTION FOR SPEECH EMOTION RECOGNITION

نویسندگان

چکیده

Speech is an effective, quick, and important way for communicating exchanging complex information between humans. Emotions have always been a part of normal human conversation which makes the speech more attractive. Because this major role both emotion, many researchers are inspired by studying Emotion Recognition (SER) still has plenty challenges. In study, we proposed novel reservoir computing approach with initialization random connection weights input weight truncated distribution. Furthermore, Population-Based Training (PBT) adopted to optimize hyperparameters whole Echo State Network (ESN) model significant impact on performance. The bidirectional increase memorization capability, Sparse Random Projection (SRP) was applied dimensional reduction as simple, unsupervised, low complexity approach. speaker-independent strategy employed EMODB SAVEE datasets acted emotion dataset Aibo non-acted dataset. achieved 84.8%, 65.95%, 45.99% unweighted average recalls EMODB, SAVEE, respectively. results show that outperforms recent state-of-the-art studies cheaper computational cost.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Emotion Recognition with Emotion-Pair Based Framework Considering Emotion Distribution Information in Dimensional Emotion Space

In this work, an emotion-pair based framework is proposed for speech emotion recognition, which constructs more discriminative feature subspaces for every two different emotions (emotion-pair) to generate more precise emotion bi-classification results. Furthermore, it is found that in the dimensional emotion space, the distances between some of the archetypal emotions are closer than the others...

متن کامل

Neuro-Inspired Speech Recognition Based on Reservoir Computing

This chapter investigates the potential of recurrent spiking neurons for classification problems. It presents a hybrid approach based on the paradigm of Reservoir Computing. The practical applications based on recurrent spiking neurons are limited due to the lack of learning algorithms. Most of the previous work in the literature has focused on feed forward networks because computation in these...

متن کامل

Fuzzy Truncated Normal Distribution with Applications

In this paper, fuzzy truncated normal distribution is shown. Also, some examples and applications are presented.

متن کامل

Speech Emotion Recognition Using Scalogram Based Deep Structure

Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract   Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Malaysian Journal of Computer Science

سال: 2022

ISSN: ['0127-9084']

DOI: https://doi.org/10.22452/mjcs.vol35no2.3